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I. INTRODUCTION 


The reconstruction of frequency-resolved spectra from limited and undersampled measurements in the time domain 
is a significant problem throughout the physical sciences. The standard approach to solving such a problem is the 
discrete Fourier transform, which decomposes a time series in terms of its component frequencies (or, more generally, 
decomposes a series into its conjugate domain). The discrete Fourier transform offers two major advantages: no a 
priori knowledge about the signal is required, and the computation can be implemented very efficiently via the fast 
Fourier transform. Unfortunately, a major disadvantage is that the discrete Fourier transform imposes a natural bound 
on the maximum frequency resolution possible given the nature of the time series, known as the Shannon-Nyquist 
condition [T]. 

A natural question to ask is whether the Shannon-Nyquist condition can be bypassed by exploiting any additional 
knowledge we may have about the signal. Recent advances in signal processing have provided many such techniques 
for leveraging additional a priori knowledge about the signal to improve reconstruction. Our goal in this paper is to 
compare three such methods, filter diagonalization EHZ], compressed sensing |51[5HH]. and super-resolution [niMis], 
against a series of test signals in order to understand their relative strengths and weaknesses. Our comparison will be 
based on a subset of the signals contained in the Sparco toolbox [201, a Gaussian, a sum of random Lorentzians, and 
the Jacob’s Ladder signal |2]. The Sparco toolbox provides a standard set of signal processing benchmarks while the 
other signals are commonly encountered throughout the physical sciences. 

Filter diagonalization, one of the earliest techniques for bypassing the Shannon-Nyquist condition, assumes that 
the time series is generated by an underlying dynamical system with a frequency spectrum modeled by a sum of 
Lorentzians. It attempts to express the frequency spectrum as a sum of Lorentzian peaks by finding the optimal 
frequencies, linewidths, and intensities that fit the time series. Filter diagonalization has been applied to a broad 
range of signals that vary from NMR spectra miS] to scattering data |S] and image analysis g]. 

More recently, Ci minimization techniques, such as compressed sensing and super-resolution, have also been pro¬ 
posed as an alternative technique for sampling below the rate imposed by the Nyquist-Shannon condition. Rather than 
assuming a particular kind of underlying dynamical system, these techniques simply assume that the signal is sparse 
in some a priori known basis. The two methods differ in both sampling strategy and the particular optimization prob¬ 
lem to be solved. Compressed sensing is designed to recover sparse frequency spectra (or other signals) by randomly 
undersampling data over the entire time domain, and then minimizing the Ci norm of an underdetermined system of 
linear equations. Compressed sensing has been successfully applied to data acquisition in many different areas EJ, 
including the improvement of the resolution of medical magnetic-resonance imaging |22| and the experimental study 
of atomic and quantum systems muniEa]. 

Super-resolution is a related technique that shares the spirit of compressed sensing, but with a different sampling 
technique i3i lEHini na Ea EiHsa. Super-resolution was developed to recover sparse frequency spectra (or other 
signals) from regularly sampled data over a short segment of the time domain. It provides a provably convergent 
algorithm for the reconstruction of signals from these limited time-domain measurements by using a total-variation 
minimization procedure. Like compressed sensing, super-resolution has been applied to a broad range of scientific 
problems, including image |18| and video compression |29| . image denoising |30| . atomistic modeling of open quantum 
systems m, astronomy HZ] , microscopy m, and medical imaging m- 

The goal of this paper is to elucidate the strengths and weaknesses of the aforementioned signal processing techniques 
to provide a clear and coherent aid in choosing a method. To achieve this, we will first introduce the theory that 
underlies each method and outline our procedure for benchmarking the methods. Then, we will introduce the test 
signals and compare the performance of each method on each signal. Finally, we will present some general conclusions. 


II. THEORY 


A. Discrete Fourier Transform 


The Fourier transform is a cornerstone method in signal processing, as it provides a technique for decomposing an 
arbitrary function of time into its component frequencies: 
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When treating a problem numerically, we often only have access to the values of the signal f{t) on an equally-spaced 
Wpoint grid. Accordingly, we discretize the continuous Fourier transform to obtain the discrete Fourier transform: 

^ 3 

which can be reformulated as a matrix multiplication according to: 

h = (n.3) 
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where fj = f(tj), fi = f(oJi), and is the Fourier operator. Here we have assumed a uniform frequency grid 
with a spacing of fs/N, a time sampling rate of At, the maximum frequency that can be sampled is 1/At, N is the 
number of time points, and T is the time length of the signal. The Nyquist-Shannon sampling theorem states that 
if a function is band limited with maximum frequency ft, it is completely characterized with a uniform series of time 
points spaced by l/2f2. It is often more convenient to use the converse statement in signal reconstruction, which 
claims that the with a sampling rate of At the maximum frequency that can be recovered is 1/2At. This is a direct 
consequence of discretizing the Fourier transform. A major disadvantage of the discrete Fourier transform is that a 
long and uniformly-sampled time series is required to obtain good resolution in the frequency domain [T1 132|. 


B. jCi Minimization 

Cl minimization methods, including compressed sensing and super-resolution, have emerged as a powerful technique 
for bypassing the constraint of the Shannon-Nyquist theorem in the special case where the signal is known to be sparse 
in a particular basis |K1 ITT?]. 

To illustrate this, suppose we have an unknown function f{t) that we wish to recover with as few samples as 
possible. Suppose further that we can find a set of basis functions {gi{t)} such that f{t) is sparse when expanded in 
this basis. That is. 


/(t)=^A,g,(t), (II.4) 
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where most of the \j expansion coefficients are equal to zero (or near zero). Our goal is to find the set of coefficients 
{Aj}, since this would in turn identify the function f{t). All we know a priori is that most of the Xj are zero; we do 
not know which of them are zero, and we do not know their values in general. By sampling f(t) at a set of points 
{ti}, we can obtain a set of linear equations. 
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where fi = fifi) and pij = gj{ti), and our goal is to solve these equations for the set of coefficients {Aj}. While 
the t variable suggests discretization in time, we are free to sample the signal in any domain. Since we are trying 
to obtain accurate resolution by taking as few time samples as possible, in general this system of equations will be 
underdetermined and we must impose additional constraints to pick out the desired solution. 

In Cl optimization methods, including compressed sensing and super-resolution, the desired solution to the under¬ 
determined system of equations |II.5|is chosen by solving the following Ci minimization problem: 
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In this minimization problem, the Ci norm serves as a proxy for the 
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where rj is a small thresholding parameter. 

sparsity-enforcing Cq norm by selecting the sparsest set of coefficients {Aj} such that the system of equations II. 5 
satisfied to within t]. It is important to note that each gj{t) should be normalized to unity so that no single basis 
function is privileged. 

Compressed sensing and super-resolution differ in the sampling strategy, which, in turn, is often determined by 
computational and experimental constraints. Compressed sensing addresses the case where the value of the function 
f{t) is sampled at random points {t^} over the entire domain. This random sampling of points fi ensures that 
each point provides the maximum possible amount of information for the reconstruction of the signal. A key result 
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from compressed sensing is that the number of time samples fi which must be measured for accurate recovery scales 
roughly with the sparsity of the basis expansion (i.e. the number of nonzero Xj), rather than the total size of the 
basis expansion (i.e. the total number of A_,) p, I32|. 

A related method to compressed sensing is super-resolution. Unlike compressed sensing, which applies to randomly- 
sampled data, super-resolution applies to data that is regularly sampled on a short segment of the time domain. It 
has been proven that super-resolution enables the recovery of signals with frequencies at one quarter of the Shannon- 
Nyquist condition reliably m- 

A major advantage of both compressed sensing and super-resolution is that we can recover f(t) in any basis in which 
the signal is sparse. The methods work with bases as varied as wavelets [Ml 184] . treelets [35], geometric harmonics |86| . 
and polynomials dllSTl. All that is required is that we know the sparse basis ahead of time. Although this may 
seem like a strong restriction, for many scientific problems physical intuition often leads to a sparse basis. One does 
not need to pick the optimal basis; any reasonably sparse basis will work. Moreover, both compressed sensing and 
super-resolution are robust to choosing an overcomplete basis, which allows for a lot more freedom in finding a sparse 
basis. 


For example, in computational chemistry, we are often interested in resolving spectra which are known to be sparse 
directly in the frequency domain (i.e. the spectrum is mostly zero except for a few sharp frequency peaks). In this 
case, we might choose a basis of complex exponentials gj{t) = After time sampling, the matrix g^j = 

simply becomes an undersampled set of rows of the discrete Fourier transform matrix. Once the sparse coefficients 
Xj have been found by solving eq. II.6 the final spectrum may be plotted as 


/(w) = Ajgj (a;) = Y > 
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Other similar bases commonly used when applying compressed sensing or super-resolution to Fourier analysis are sine 
functions gj{t) = ^ sin (ojjt) and cosine functions gj{t) = ^ cos {ojjt). 

To take another common example in the physical sciences, we often find damped oscillatory signals which may be 
expressed as a sum of damped cosines: 


gjkit) = e cos (ujjt). 


(II.8) 


Compressed sensing and super-resolution are easily adapted to this overcomplete basis and, once the sparse coefficients 
Xjk have been found via eq. [11.6 the final spectrum may be plotted as 


= Y ^jkdjkiuj) 
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In short, compressed sensing and super-resolution both enable the recovery of an undersampled signal by using 
a customized, sparse basis that is appropriate to the problem at hand. The choice of technique typically depends 
on which sampling method is easier to perform: random sampling over the entire time domain is appropriate for 
compressed sensing, while regular sampling over a short part of the time domain is appropriate for super-resolution. 


C. Filter Diagonalization 


Filter diagonalization is another approach to circumvent the Shannon-Nyquist condition. Inspired by quantum 
mechanics, the method assumes that the signal f{t) to be recovered is generated by the time evolution of a unitary 
propagator. 


/(t) = (4>o,e-*^*<i>o). (II.IO) 

If we sample f(t) on an equally-spaced grid = nr, we can discretize this equation as 

/(t„) = ($o,e-™^"$o). 


( 11 . 11 ) 
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where U = e is the unitary propagator. By expanding the propagator in terms of its eigenvalues and (possibly 
complex) eigenvectors, 


( 11 . 12 ) 
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and substituting this expansion into III.lT] we obtain 

/(t„)=^|(u„ci>o)Pe-“^"^ (11.13) 
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which is the equation for a Lorentzian signal with (possibly damped) frequencies Wj and amplitudes \j = \{uj^ ‘l’o)P- 
Hence, resolving a signal f{t) into a sum of Lorentzian peaks is reduced to the standard linear algebra problem of 
finding the eigenvalues and eigenvectors of the propagator IJ = 

The key insight of filter diagonalization is that the propagator to be diagonalized, may be expressed entirely 

in terms of time samples of the signal /„ = /(tn)- A common approach is to write the propagator in a so-called 
Krylov basis. 




E 


n=0 



(11.14) 


where Zk = is a complex value chosen along the unit circle. By selecting the Vk close to the frequencies we 

wish to resolve, it is possible to filter f{t) and recover only those frequency components near the Vk', this is where the 
name filter diagonalization comes from. It is important to include more basis vectors I'kfe) than there are frequencies 
we wish to resolve. 

Expressing the propagator in the Krylov basis yields 

N N 

Ukk> = {^k,U^k') = E E fn+n' + lZ^Z-r' , (11.15) 

n —0 n '—0 

which is expressed completely in terms of time samples of the signal. Because the Krylov basis is not orthonormal, 
we also need the overlap matrix 

N N 

Skk' = ('kfc, ^ fn+n'Z^-Z^r\ (11.16) 

n—Q n'—O 


after which the eigenvalues and eigenvectors of U may be found by solving the generalized eigenvalue problem, 


UBj = UjSBj. 


(11.17) 


For computational efficiency, the double sums in eqs. 11.15 and 11.16 are rewritten as single sums, as shown in |3]. 
With the eigenvalues Uj and eigenvectors 


Uj — Bkj'^k 
k 


(11.18) 


in hand, the frequencies coj and amplitudes Xj in the signal f{t) may be reconstructed according to the formulas 


Uj = e and 

Xj = |(u^-,4>o)|^ = 


2 




(11.19) 

( 11 . 20 ) 


Because the Krylov basis is often close to becoming linearly dependent, we include a numerical conditioning step 
to remove possible spurious frequencies. In particular, we select a value of p and resolve the generalized eigenvalue 
equation with [/p+i and IJp (used in place of U and S). We remove the eigenvalues that are not shared in the two 
spectra, and then select a filtering grid with frequencies Vj located only at the nonspurious eigenvalues. We rerun 
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filter diagonalization one more time on this adaptive frequency grid |7], and these are the results we report below. 

While this technique was initially derived with quantum mechanics in mind, it is not limited to such applications. 
Indeed, with generalizations such at 2D filter diagonalization and multi-resolution filter diagonalization, the method 
has been expanded to be applicable to a broad range of signals. 


III. METHODS 

As our goal in this paper is to compare the performance of compressed sensing, super-resolution, and filter di¬ 
agonalization in recovering sparse signals, we began by obtaining a series of sparse signals from the Sparco toolkit, 
which is a well-known set of sparse signals used for benchmarking various signal processing techniques [50]. We also 
generated a few other signals of interest to highlight particular properties of each technique. Unless otherwise stated, 
each signal began as a continuous function of time f{t) and, to generate a discrete time series, we sampled f{t) at 
4096 time points ranging uniformly from t = 0 to t = 1 second. This gave a grid separation of 1/4096 seconds, with 
a maximum recoverable frequency of 2048 Hz. 

For each sparse signal processing method, we varied how many of the 4096 time points we sampled (in increments of 
64) and investigated the dependence of the recovery error on the extent of undersampling. As a measure of the recovery 
error, we employed the relative 2-norm error over all 4096 time points (regardless of the extent of undersampling): 

Recovery Error = I/recovered (tQ - /original (t^)P 

l/origi„al(U)P 

We consistently obtained similar results with the 1-norm error and the oo-norm error, but the 2-norm error has the 
advantage that, by Parseval’s theorem, it is the same whether it is measured in the time domain or the frequency 
domain. Therefore, we adopted the 2-norm as our primary benchmark. In some instances, filter diagonalization has 
been marketed as a parameter estimation technique, but this problem is identical to reconstruction of the signal as 
a whole. That is, if a method can extract the characteristic parameters of the signal, then these parameters can be 
used to reconstruct the signal. 

For compressed sensing and super-resolution, we attempted to recover each signal in an appropriate sparse basis. 
The basis used depends on the signal and is discussed in the individual sections below. Because super-resolution 
requires a grid of equally spaced sample points, we began our analysis by examining the full signal and computing the 
errors. We then repeated our analysis for the signal by successively undersampling in powers of two, taking care to 
ensure that our sample points were always equally spaced. In contrast, our analysis with compressed sensing involved 
randomly selecting the same number of points that were included in the super-resolution analysis at each step. 

For filter diagonalization, we used the same regular sampling strategy as for super-resolution, and we monitored 
the recovery error as a function of the sampling. Filter diagonalization requires specifying a grid of frequencies on 
which we expect the components of the signal to lie, so we specified a frequency range of 0 kHz to 20 kHz. To find the 
appropriate grid density and number of frequencies, we tuned these two parameters for optimal reconstruction with 
the full time signal and assumed these parameters would be valid for the entire numerical experiment. To ensure the 
robustness of our results, we also performed the analysis from 0 to 5, 10, 50, 100, 200, and 500 kHz, with a similar 
density of frequencies. 

From a numerics standpoint, compressed sensing and super-resolution require a fast, memory-efficient Ci solver. 
For all results in this paper, we implemented the two step iterative shrinkage/thresholding (TwIST) algorithm in 
Python |88l 189] . Our TwIST solver is capable of solving arbitrary optimization problems given a measurement 
matrix, signal vector, and objective function, and gives numerically identical answers to the Matlab version for a wide 
range of test signals, including all those in this paper. 

We implemented filter diagonalization in Python, performing all required matrix diagonalizations using the zgeev 
function from LAPACK. We benchmarked our implementation against Harminv, a freely-available CH—h implementa¬ 
tion based on the methods described in |7|, and found that they give the same answers to within numerical precision 
for a wide range of signals, including all of those presented in this paper. 


A. Gaussian Signal 


We begin with one of the most ubiquitous signals throughout signal processing, a simple Gaussian centered at time 
t = 0 with a = 0.4 (Fig. la). 


/(t) = e ST! 


(HI.2) 
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Figure 1: (a) Time series consisting of a Gaussian given by Equation (IIL2I centered at t = 0 with standard 
deviation a = 0.4. (b) Comparison of the relative 2-norm error in the reproduction of a Gaussian signal as a 
function of undersampling betweening compressed sensing, super-resolution, and filter diagonalization. 


To recover this signal with compressed sensing and super-resolution, we employ a basis of displaced Gaussians 

9jkit) = e~'~^, (IIL3) 

with 100 centers tj ranging uniformly from 0 to 1, and 100 standard deviations ak also ranging uniformly from 0 to 
1, for a total of 10,000 different basis functions. It is clear that the function we hope to recover, f{t), is sparse in this 
basis. 

Fig. |lb| compares the performance of compressed sensing, super-resolution, and filter diagonalization in recovering 
the Gaussian signal. Compressed sensing and super-resolution both converge quickly to the correct signal, and as more 
time-domain information is sampled, the signal becomes more obviously composed of a single Gaussian. Moreover, 
compressed sensing converges more quickly than super-resolution, indicating that randomly sampling over the entire 
time domain provides more complete information about the overall shape of the signal than sampling uniformly with 
a coarse grid. Both compressed sensing and super-resolution recover a single strongly converged, correct, peak with 
amplitude 1 and a few spurious peaks with amplitudes smaller than 10“®. This represents a small numerical instability 
in our implementation of TwIST but these spurious features are easy to identify and disregard. 

By contrast, filter diagonalization fails to converge completely because it attempts to recover the Gaussian as a 
sum of Lorentzian peaks, rather than taking advantage of the natural sparsity of the signal in a Gaussian basis. 
This example highlights the basis set agnosticism of the Ci minimization techniques, which is one of their principal 
advantages. 


B. Sparco Problem 1 


For our second signal, we consider a sinusoid that is “disrupted” by two Heaviside step functions (Fig. 2aI, 


f{t) = 4sin(47rt) — 0(t — 0.3) — 0(0.72 — t). 


(III.4) 


one of the earliest signals used to benchmark wavelet and compressed sensing techniques iHiiiniiii]- This signal is 
Problem 1 in the Sparco toolbox of sparse signals [20]. 

To recover this signal via compressed sensing and super-resolution, we employ a composite basis of sine functions 
and Heaviside step functions 


9jit) = sin(wjt) 
hj{t) — Q{t — tj), 

with the spectral spacing uij of the sine functions ranging uniformly from 0 to 40967r kHz in units of kHz, and the 
unit steps tj of the Heaviside step functions ranging uniformly from 0 to 1 second in units of 0.01 seconds. While 
either basis gj{t) or hj(t) by itself would provide a complete basis for recovery of the signal f{t) (to within numerical 
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Figure 2: (a) The combination of a sinusoid and a Heaviside signal in the time domain as given by Equation (IIL4I. 

(b) Comparison of the 2-norm error in the reproduction of Problem 1 from the Sparco toolbox as a function of 
undersampling between compressed sensing, super-resolution, and filter diagonalization. We attribute the large spike 
in error by filter diagonalization to the recovery of spurious exponentially divergent solutions. 
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precision), the function f{t) would not be sparse in either basis on its own. On the other hand, there is no problem in 
Cl minimization techniques with using the combined basis, which affords the additional advantage that f(t) is sparse 
in this combined basis. However, when building a composite basis with different functional forms, it is important to 
ensure that each basis function is normalized to the same value, for which we chose unity. 

Fig. |2b| compares the performance of compressed sensing, super-resolution, and filter diagonalization in recovering 
the signal f{t). For super-resolution and filter diagonalization, both of which involve regular sampling over a coarse 
time domain grid, most of the initial error simply comes from the fact that the methods cannot reproduce aspects of 
the signal that have not been sufficiently sampled. 

Super-resolution and compressed sensing are able to identify that the signal has some underlying sine structure, but 
initially fails to recognize the exact position of the step functions. As more samples are included in the analysis, both 
techniques are able to quickly converge to the exact location of the step function. This convergence leads to a very 
sharp phase transition characteristic of an Ci analysis, and represents the minimum amount of information required 
to exactly reproduce the full signal. This phase transition is a well known aspect of Ci minimization techniques, and 
provides a useful and valid check on convergence and accuracy. 

This stands in contrast to filter diagonalization, which attempts to match the Heaviside step functions by creating 
a signal that contains exponentially growing components, eventually resulting in an explosion of error. To better 
understand this behavior, we varied the magnitude of the Heaviside step functions, but found that the creation of 
an exponentially growing signal persisted even when the Heaviside step function was 0.1% of the amplitude of the 
oscillating sine wave. 

Not surprisingly, compressed sensing fares better than both super-resolution and filter diagonalization. This is easily 
explained by the fact that compressed sensing randomly samples the entire domain, so it can quickly “recognize” all 
features of the signal and recover them accurately. After roughly one sixty fourth of the signal has been sampled, the 
error changes only marginally, and this effect is robust across different runs of random sampling. 


C. Sparco Problem 5 


For our third signal, we consider the sum of three cosines with the addition of 40 spikes at random time points {ti} 

(Fig.R, 


f{t) = 2cos(27rt) -I- 3cos(97rt) — cos(207rt) 
+ y^^aiS{t - tj), 

i 


(HL5) 


where 6{t — ti) is regarded here as the Kronecker delta function (equal to 1 at the time point t = ti, 0 otherwise) 
and tti is a uniform random number between 0 and 1. This signal is Problem 5 in the Sparco toolbox of sparse 
signals |20| . and including it in our comparison is particularly useful for benchmarking the ability of compressed 
sensing, super-resolution, and filter diagonalization to deal with random noise. 

To recover this signal via compressed sensing and super-resolution, we employ a cosine basis: 


gj{t) = cos {ojjt) 


with the spectral spacing ojj of the cosine functions ranging uniformly from 0 to 40967r kHz in units of ^ kHz. Note 
that we do not include Kronecker delta functions 6{t — ti) in our basis, since our goal is to see whether our signal 
processing methods can recover the underlying cosine functions despite the random noise. 

As shown in Fig. 3b both compressed sensing and super-resolution successfully recover the underlying cosine 
functions in spite of the noise peaks (the noise peaks simply get absorbed into the denoising parameter rj). As 
expected, compressed sensing recovers the signal with less sampling than super-resolution, since randomly sampled 
points over the entire time domain effectively contribute more information than regularly sampled points over that 
same time domain. 


In contrast, filter diagonalization does not include a robust denoising procedure, and the method struggles with 
the Kronecker delta peaks because they represent sharp deviations from the underlying cosine signal. In particular, 
in attempting to match the Kronecker delta peaks, filter diagonalization creates a signal that contains exponentially 
growing components rather than exponentially damped Lorentzians. 

In summary, compressed sensing and super-resolution both pick out the underlying cosine signals by denoising the 
Kronecker delta peaks, whereas filter diagonalization does not. 
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Figure 3: (a) The combination of three cosines and random spikes in the time domain as described in Equation 
(III.Sl. (b) Comparison of the 2-norm recovery error for the Problem 5 from the Sparco toolbox as a function of 
undersampling for compressed sensing, super-resolution, and filter diagonalization. We attribute the large error 
peaks found by filter diagonalization to the recovery of spurious exponentially divergent solutions. 
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Figure 4: (a) The Jacob’s Ladder signal, a combination of many Lorentzian peaks, in the time domain as given by 
Eq. (IIL 6 I. (b) Comparison of the 2-norm error for the reproduction of the Jacob’s Ladder signal as a function of 


undersampling for compressed sensing, super-resolution, and filter diagonalization. We have no explanation clear for 
why our implementation of filter diagonalization fails to reproduce this signal, and we note that further work may be 

necessary. 


D. Jacob’s Ladder 

Next, we consider a time series devised by some of the original developers of filter diagonalization for benchmarking 
sparse signal processing techniques |5]. This signal is known as Jacob’s Ladder, and it consists of a very large number 
of Lorentzian peaks (Fig. [4a]): 


49 


/- = E 


— 1.8m7r2xl0 "**71 


[cos(1.8TO7r2 X 10"‘‘2500n) 


(IIL6) 


m—0 


cos(1.8m7r2 x 10 ^2487.5n) -|-cos(1.8m7r2 x 10 ^2475.On)] 


In this experiment, we created a signal with 1000 data points that ranged, uniformly, from 0 to 1 seconds using the 
formula in Equation (IIL6). We first analyzed this signal with a numerical implementation of filter diagonalization 


using a frequency range of 0 kHz to tt kHz and assumed that we had a maximum of 4500 frequencies in our signal. 

We started by performing the filter diagonalization analysis with the above frequency grid using the entire signal, 
and used the recovered frequencies and expansion coefficients to construct the recovered signal. This allowed us to 
compute the 2-norm error between the recovered signal and the exact signal over the entire range. We then repeated 
the analysis, while successively undersampling, first taking every second point, then every fourth, fifth, eighth, tenth, 
twentieth, twenty-fifth, fortieth, fiftieth, and finally, hundredth. We then performed the same analysis with super¬ 
resolution and compressed sensing. Because compressed sensing involved random sampling, we took care to randomly 
sample the same number of points that were included in the super-resolution analysis at each step. 

In order to perform the Ci analysis we examined the functional form and the signal itself and concluded that it 
should be sparse in the basis of damped oscillators. Thus we constructed such a basis with a spectral spacing of 0.01 
Hz and a maximum of 47r Hz. We also scanned exponential decay parameters ranging from 0 to tt Hz in steps of 0.01 
Hz. For the super resolution analysis, we performed the same time addition procedure that was performed with filter 
diagonalization. We began the compressed sensing analysis by picking 50 random time points, and at each subsequent 
step an additional 50 random points were taken from those remaining until we were sampling the full signal. 

From the errors given in Figure |4b[ both compressed sensing and super resolution converge to a better answer 
more rapidly than filter diagonalization. We attribute these errors to the recovery of exponentially divergent solutions 
but we have no way of accounting for the difference between our results and those obtained elsewhere. Our working 
theory is a sensitivity of filter diagonalization to the frequency grid and parameter choice, and we have not found 
the correct combination of parameters that allows us to completely recover the desired signal. This suggests that our 
implementation, as well as our standard of comparison suffered from significant numerical instability. We are unsure 
whether this is explained better by fundamental instabilities in the method at hand, or simply instabilities in the 
current implementations. 

By contrast, super-resolution and compressed sensing do not suffer from these same problems. Many current meth- 
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Figure 5: (a) Combination of twenty random damped cosines as described in Eq. ( III.7| . (b) Comparison among 
compressed sensing, super-resolution, and filter diagonalization of the 2-norm error in the reproduction of the 
combination of random damped cosines as a function of undersampling. We attribute the large error peaks found by 
filter diagonalization to the recovery of spurious exponentially divergent solutions. 


ods are extremely stable. These techniques only include the basis functions that are explicitly chosen. Unfortunately, 
we are limited to the recovery of Lorentzian parameters (frequency and line-width), that are on the grid, which re¬ 
quires a sufficiently dense set of parameters for accurate recovery. As a result, -optimization methods become more 
and more memory intensive as the number of basis functions increases. 


E. Sum of Random Lorentzians 


For the final comparison, we created a sum of twenty random damped cosines: 


m = 


20 

E' 


,-7rri 


COSCUnt 


(III.7) 


with 7 „ drawn from a uniform random distribution ranging from 0 to 20 Hz and ujn ranging from 0 to SOtt Hz. This 
type of autocorrelation signal is ubiquitous not only in chemistry applications, but in signal processing at large. 

To recover this signal via compressed sensing and super-resolution, we employ a basis of damped cosine functions: 

9jk{t) = cos {ojjt). 

Here, the spectral spacing ojj ranges uniformly from 0 to 407r Hz in steps of 7r/24 Hz and the damping parameters jk 
ranges uniformly from 0 to 20 Hz in steps of 1/2 Hz. For filter diagonalization, we selected a frequency range from 0 
to IOOtt Hz and chose a basis of 1200 frequencies. This was chosen because it gave near perfect reconstruction of the 
full signal, while bases smaller than this were prone to numerical instability. 

As shown in Fig. both compressed sensing and super-resolution successfully recover the underlying damped 
cosine structure but is restricted to the functions on the grid. This is the most significant source of error. As 
expected, compressed sensing recovers the signal with less sampling than super-resolution, since randomly sampled 
points over the entire time domain effectively contribute more information than regularly sampled points over a short 
time. 

In contrast, filter diagonalization can recover off-grid frequencies extremely efficiently. Because of this, the final 
errors should be smaller than the errors from both compressed sensing and super resolution. Unfortunately, we 
encountered significant stability issues during many of our decompositions that resulted in exponentially growing 
solutions. These solutio ns g ave £2 errors on the order of 10"*^ at times (note that we have chosen to only plot a few 
orders of magnitude in 5b ) While filter diagonalization is capable of giving a much better answer, the technique is 


significantly more sensitive to slight deviations in the operational parameters chosen. 
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IV. CONCLUSIONS 


In conclusion, we have performed a broad comparison of three different signal processing techniques that attempt to 
“beat” the Shannon-Nyquist limit. With prior information about a reasonable basis for your signal, Ui minimization 
techniques provide a robust and faithful reproduction of the signal. We emphasize that the difference between super¬ 
resolution or compressed sensing is simply a choice of sampling procedure and normally is determined by the data 
acquisition technique. 

Additionally, we found that if the signal at hand was Lorentzian, filter diagonalization was capable of significantly 
outperforming both compressed sensing and super resolution because of its ability to sample off the grid. Even still, 
the technique was sensitive to a broad range of parameters, which were capable of making it divergent if chosen 
incorrectly. Given enough tuning and the appropriate signal form, however, filter diagonalization is the superior 
method for these types of signals. 
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